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Some research has shown that creativity test scores are independent from IQ scores, whereas other research has shown a rela- 
tionship between the two. To clarify the cumulative evidence in this field, a quantitative review of the relationship between 
creativity test scores and IQ scores was conducted. Moderating influences of IQ tests, IQ score levels, creativity tests, cre- 
ativity subscales, creativity test types, gender, age, and below and above the threshold (IQ 120) were examined. Four hundred 
forty-seven correlation coefficients from 21 studies and 45,880 participants were retrieved. The mean correlation coefficient 
was small {r = .174; 95% Cl = .165 - .183), but heterogeneous; this correlation coefficient indicates that the relationship 
between creativity test scores and IQ scores is negligible. Age contributed to the relationship between intelligence and cre- 
ativity the most; different creativity tests contributed to it secondly. This study does not support threshold theory. 


G uilford (1962) hypothesized that creative indi- 
viduals possess divergent thinking abilities that 
traditional IQ tests do not measure, and other 
researchers have shown that creativity test scores, divergent 
thinking tasks, and creative achievement are independent 
from IQ(e.g., Getzels & Jackson, 1958; Gough, 1976; 
Guilford, 1950; Helson & Grutchfield, 1970; Helson & 
Grutchfield 1971; Herr, Moore, & Hansen, 1965; 
Rossman & Horn, 1972; Rotter, Langland & Berger, 
1971; Torrance, 1977). In seeming contrast, other research 
has shown a relationship between creativity test scores and 
IQ scores (e.g., Runco & Albert, 1986; Wallach, 1970). 

However, many researchers agree with the threshold 
theory, which explains that creativity and intelligence are 
separate constructs; that is, more intelligence does not nec- 
essarily mean greater creativity. Threshold theory assumes 
that, below a critical IQ level (which is usually said to be 
about 120), there is some correlation between IQ and cre- 
ative potential, and above it there is not (Barron, 1961; 


Getzels & Jackson, 1962; Guilford, 1967; Guilford & 
Ghristensen, 1973; MacKinnon, 1961, 1962, 1967; 
Simonton, 1994; Walberg, 1988; Walberg & Herbig, 
1991; Yamamoto, 1964). Although threshold theory sup- 
poses that intelligence and creativity are related only up 
to an IQ of approximately 120, investigations of the 
threshold theory have contradictory and inconclusive 
results (e.g., Runco & Albert, 1986). This inconsistency 
may come from different measures of intelligence and cre- 
ativity and various features of samples, including gender, 
age, and socioeconomic status (SES). 

Meta-analysis, or quantitative synthesis, attempts to 
find resolution of apparent conflicts in literature, to dis- 
cover consistencies and account variability in similarly 
appearing studies, and to identify core issues for future 
research (Gooper & Hedges, 1994). Thus, this study syn- 
thesized empirical research in the areas of creativity and 
intelligence for the purpose of creating a generalization 
about the relationship between creativity and intelligence. 
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The four primary purposes of this synthesis were to 

1. conduct a quantitative synthesis of correlations 
between IQ and creativity test scores; 

2. compare the correlations between IQ and creativity 
scores for IQ above 120 with those for IQ below 120 
to confirm threshold theory; 

3. identify some of the variables that moderate those cor- 
relations (e.g., IQ tests, different levels of IQ scores, 
creativity tests, creativity test types, creativity sub- 
scales, gender, and age); and 

4. use the correlations derived from the quantitative syn- 
thesis to investigate models of the relationships 
between intelligence and creativity. 

Methods 

Literature Review 

More than 100 studies published from 1961 through 
the summer of 2004 were located from computer searches 
and personal retrieval within the body of English-language 
creativity and intelligence literature using Academic Search 
Premier, ERIC, PsycARTICEES, PsycINFO, and biblio- 
graphic searches of each reference. IQ, creativity, intelli- 
gence, and threshold theory were the keywords used. The 
criteria for study selection included the reporting of the 
correlations among measures of intelligence and creativ- 
ity; t tests, chi-square, and Atests with a single df, as well as 
exact p values. Much of the research examined in the pre- 
sent study failed to report detailed information on proce- 
dures, results, or both, thus making it difficult or 
impossible to generate correlation coefficients. Some stud- 
ies did report on IQ scores and creativity scores, but the 
correlation coefficients that were reported were IQ and cre- 
ativity scores with various creative achievements. 

Effect Size Calculations 

A quantitative synthesis of the remaining 2 1 studies 
was conducted, which was assisted by Schwarzer’s Meta 5.3 
statistical software. Fisher’s z transformation of r was used 
for analyses in order to adjust for the nonnormal distribu- 
tion of r. Most researchers believe that studies that employ 
large samples should get more credit than those that are 
based on small samples because correlations are known to 
become more stable as sample size increases (Schwarzer, 
1991). Thus, the effect size z,. was weighted by sample size: 
The weighted mean z,. = S(Ny - 3) I S(Ny - 3), in this 
equation z^j is the Fisher z^, corresponding to any r 
(Rosenthal, 1991). Each mean rwas tested using a random 


effects model of variance, and the reported values of r are 
back transformations from 2 : (Hunter, Schmidt, & 
Jackson, 1982). A correlation coefficient was judged het- 
erogeneous when the residual standard deviation exceeded 
three fourths of the effect size and sampling error was less 
than 75% of the observed variance (Schwarzer). The resid- 
ual standard deviation was used as the standard error in 
estimating 95% confidence intervals (Cl). 

Four-hundred forty-seven correlation coefficients were 
retrieved from 21 studies, for a total sample size of 45,880 
people. Multiple correlation coefficients were obtained 
from studies that included more than one correlation using 
several creativity subscales and reported separate results 
for gender. When using several correlation coefficients per 
study in analyses of moderators, a conservative statistical 
criterion [p < .001) was used to protect against Type I error 
(Rosenthal, 1991; Rosenthal & Rubin, 1984). As reported 
above, individual correlation coefficients were weighted by 
sample sizes (Hedges & Olkin, 1985; Hunter & Schmidt, 
1990; Rosenthal) in order to adjust for sampling errors in 
r, giving more credence to studies with large samples 
because small sample sizes can increase Type II error 
(Hedges & Olkin). 

Moderator Variables 

In order to test whether correlation coefficient sizes 
vary systematically across differing levels of variables that 
are posited to influence the relationship between intelli- 
gence and creativity, analyses of moderator variables were 
conducted. Correlation coefficient size heterogeneity can 
be explained through moderator variables. The modera- 
tor variables were categorized in a manner that seemed the- 
oretically valid. The correlation coefficients between 
divergent thinking abilities and intelligence vary widely 
depending upon the divergent thinking tests, the hetero- 
geneity of the sample, and the testing conditions (Barron 
& Harrington, 1981). Therefore, factors that might mod- 
erate the estimated population correlation coefficients 
between IQ scores and creativity scores in the present 
study were considered by comparing correlation coeffi- 
cients based on various IQ tests, levels of IQ scores, sev- 
eral creativity tests, creativity test types, creativity subscales, 
gender, age, and IQ scores below and above 120. Most 
studies did not report information about subjects’ ethnic- 
ity, making it impossible to include ethnicity as a modera- 
tor variable. 

When data was entered, the creativity test subscales 
of associational fluency, ideational fluency, and verbal and 
figural fluency were classified as “fluency,” whereas sponta- 
neous flexibility, adaptive flexibility, and verbal and fig- 
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Figure 1. Stem-and-leaf-display for 447 correlation coefficients (r) between IQ and creativity scores 


Note. The correlation coefficients r ranged from -.46 in the first row of the display to +.76 in the last row. 


ural flexibility were classified as “flexibility.” Both verbal 
and flgural originality were categorized as “originality.” In 
several of the creativity tests in the present study (e.g., 
Flescher, 1963; Getzels & Jackson, 1962; Hocevar, 1980; 
Kogan & Pankove, 1972; Vernon, 1972; Wallach & 
Kogan, 1965), subscales were not reported. Thus, the tasks 
in these studies were examined and categorized based on 
the nature of those tasks. For age, 87 kindergarteners (from 
Ward, 1968) were included in the elementary group, and 
naval recruits (from Richards, 1976) and college students 
were included in the adult group. 

Hedges’ g was derived for each individual correlation 
coefficient in the moderator analysis using Johnson’s 
DSTAT 1.10 meta-analysis software (1993). When sig- 
nificant moderator effects were found, the proportion of 
variance in the variables that were accounted for by the 
moderators was determined. Variables that were significant 
moderators were entered into a weighted linear multiple 
regression model (weighted by N - 3; Rosenthal, 1991) 
using SPSS to clarify their independent effects for explain- 
ing variations in z^. Then, threshold (IQ 120) was added 
to the previous weighted linear multiple regression model 
to check the Ri change to clarify whether the threshold 
theory had its independent effect for explaining a variation 
in z^. 

Results 

A stem-and-leaf display of the 447 effects is presented 
in Figure 1, indicating a nearly normal distribution of cor- 


relation coefficients. The mean value of r was .137 (95% 
Cl = .128 - .146), or .174 (95% Cl = .165 - .183) after 
weighting by sample size. Thus, the mean correlation coef- 
ficient was judged to be small and statistically significant. 
However, the correlation coefficients were heterogeneous, 
Q(446) = 937.058, p < .0001; residual standard deviation 
was greater than one fourth of population correlation coef- 
ficient size (.044); and percent of observed variance 
accounted for by sampling error was 47.64%, which was 
less than 75%. 

Because the data set was heterogeneous, searching for 
moderators that may account for the remaining systematic 
variation in the data was necessary (see Schwarzer, 1991). 
Analyses of moderators were performed by breaking down 
the data into at least three subsets with respect to theo- 
retically relevant variables. Moderator analyses were con- 
ducted to determine whether moderators describing 
subjects or features of IQ and creativity tests might 
account for variability in the magnitude of correlation 
coefficients. Each simple contrast tested as z at p < .001. 
Correlation coefficients for each moderator are reported 
in Tables 1-8 as mean r. Variability in the magnitude of 
correlation coefficients between creativity test scores and 
IQ scores among different IQ tests [QB [6] = 170.193, p 
< .0001), different IQIevels {QB [4] = 55. 441, p < .0001), 
threshold (Q5[2] = 17.625, p < .001), different creativity 
tests (QR[3] = 203.079, A < .0001), creativity test types 
{QB [3] = 34.718, p < .0001), creativity subscales {QB 
[4] = 88.380,p < .0001), gender {QB [2] = 19.163, p < 
.0001), and age {QB [4] = 223.282, p < .0001) were sig- 
nificant. 
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Threshold 

As Table 1 shows, the mean of the 65 correlation coef- 
ficients for below IQ 120 was .201. The correlation coef- 
ficients were heterogeneous, Q(64) = 159.170,/) > .0001. 
The mean of the 14 correlation coefficients for above IQ 
120 was .235. The correlation coefficients were heteroge- 
neous, Q(13) = 45.654,/) > .0001. These heterogeneities 
observed within the different levels or classes imply that 
the mean correlation coefficients for each level or class can- 
not be adequately described with a single correlation coef- 
ficient. In other words, the variation in the relationships 
is due to one or more additional factors that the levels or 
classes do not capture. 

Post hoc contrasts revealed no statistically significant 
differences among below and above the threshold and 
unreported, likely because there were so few correlation 
coefficients for above IQ 120. On an a priori basis, IQ 
below (r = .201) and above 120 (r = .235) were not statis- 
tically different either, = 2.004,/) = .157. 

IQ Levels 

As Table 2 shows, the mean of the 32 correlation coef- 
ficients for IQ <100 was .260. The correlation coefficients 
were homogeneous, Q(31) = 44.500,/) = .134. The mean 
of the 33 correlation coefficients for 100 < IQ >120 was 
.140. The correlation coefficients were heterogeneous, 
Q(32) = 98.462,/) > .0001. The mean of the 13 correlation 
coefficients for 120 < IQ> 135 was .259. The correlation 
coefficients were heterogeneous, Q(12) = 39.841,/) < .001. 
The mean of the two correlation coefficients for IQ >135 
was -.215. The correlation coefficients were homogeneous, 
Q(l) = .234,/)= .890. 

Post hoc contrasts revealed no statistically significant 
differences among the different IQ levels. On an a priori 
basis, however, IQ < 100 (r = .260) and 100 < IQ >120 
(r = .140) were statistically different, = 16.208,/) < 
.0001. IQ< 100 (r = .260); IQ> 135 (r = -.215) were sta- 
tistically different, = 12.252,/) < .001. 100 < IQ> 120 
(r = . 140); 120 < IQ > 135 (r = .259) were statistically dif- 
ferent, %" = 18.261, /)< .0001. 120 <IQ> 135 (r=.259); 
and IQ> 135 (r = -.215) were statistically different, = 
12.307, /)< .001. 

IQ Tests 

As Table 3 shows, the mean of the 21 correlation coef- 
ficients for the Terman Concept Mastery Test (TCMT) 
was .260. The correlation coefficients were homogeneous, 
Q(20) = 41.341,/) > .001. The mean of the 1 14 correlation 


Table 1 

Threshold as a Moderator 


Threshold 

N 

(# of r) 

Mean r 

Homogeneity 

Above IQ 120 

65 

.201 

heterogeneous 

Below IQ 120 

14 

.235 

heterogeneous 

Unreported 

368.1 63heterogeneous 


Note. No statistically significant difFerences between the groups {p > .001). 
Homogeneity: heterogeneous when p < .001. 

Model for Threshold, Q5(2) = 17.625 {p < .001). 

Table 2 


IQ Levels as 

a Moderator 

IQ Eevel 

N 

(# of r) 

Mean r 

Homogeneity 

IQ< 100 

32 

.260 

homogeneous 

100 < IQ> 120 

33 

.140 

heterogeneous 

120 < IQ> 135 

13 

.259 

heterogeneous 

IQ> 135 

2 

-.215 

homogeneous 

Unreported 

367 

.162 

heterogeneous 


Note. No statistically significant difFerences between different IQ levels [p > .001). 
Homogeneity: heterogeneous when p < .00 1 . 

Model for IQ levels, Q5(4) = 55.441 (p < .0001). 


coefficients for the California Test of Mental Maturity 
(CTMM) was .191. The correlation coefficients were het- 
erogeneous, Q(113) = 240.652,/) > .0001. The mean of 
the 60 correlation coefficients for the Wechsler Intelligence 
Scale for Children (WISC) was .097. The correlation coef- 
ficients were homogeneous, Q(59) = 52.377,/) = .500. The 
mean of the 40 correlation coefficients for the School and 
College Ability Test (SCAT) was .084. The correlation 
coefficients were homogeneous, Q(39) = 33.927,/) = .515. 
The mean of the 100 correlation coefficients for the 
Sequential Tests of Educational Progress (STEP) was .096. 
The correlation coefficients were homogeneous, Q(99) = 
95.872,/) = .795. The mean of the 18 correlation coeffi- 
cients for the Peabody Picture Vocabulary Test (PPVT) 
was .018. The correlation coefficients were homogeneous, 
Q(17) = 32.922,/) = .017. 

Post hoc contrasts revealed statistically significant dif- 
ferences (/) < .0001) between CTMM and STEP. On an a 
priori basis, CTMM (r = .191) and STEP (r = .096) were 
also statistically different, = 27.943,/) < .0001. 
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Table 3 

IQ Tests as a Moderator 


IQ Test 

N 

(# of r) 

Mean r 

Homogeneity 

Contrast 

p-value 
for contrast 

TCMT 

21 

.187 

homogeneous 



CTMM 

114 

.191 

heterogeneous 

CTMM/ STEP 

< .0001 

Wise 

60 

.097 

homogeneous 



SCAT 

40 

.084 

homogeneous 



STEP 

100 

.096 

homogeneous 

CTMM/ STEP 

< .0001 

PPVT 

18 

.018 

homogeneous 



Others 

94 

.219 

heterogeneous 




Note. TCMT = Terman Concept Mastery Test; CTMM = California Test of Mental Maturity; WISC = Wechsler Intelligence Scale for Children; SCAT = School and College Ability Test; STEP = Sequential Tests of 
Educational Progress; PPVT = Peabody Picture Vocabulary Test. 

Homogeneity: heterogeneous when p < .001. 

Model for IQ tests, Q5(6) = 170.193 ( p < .0001 


Table 4 


Creativity Tests as a Moderator 


Creativity Test 

N 

(# of r) 

Mean r 

Homogeneity 

Contrast 

p-value 
for contrast 

Guilford 

64 

.250 

heterogeneous 

Guilfo rd/ Wall ach- K 

< .0001 

TTCT 

18 

.218 

homogeneous 



Wallach-K 

319 

.116 

homogeneous 

Guilford/Wallach-K 

< .0001 

Others 

46 

.242 

heterogeneous 




Note. Guilford = Guilford divergent thinking tasks; TTCT = Torrance Tests of Creative Thinking; Wallach-K = Wallach & Kogan Divergent Thinking Tasks 
Homogeneity: heterogeneous when p < .001. 

Model for Creativity tests, QB{5) = 203.079 {p < .0001). 


Creativity Tests 

As Table 4 shows, the mean of the 64 correlation coef- 
ficients for the Guilford Tests was .250. The correlation 
coefficients were heterogeneous, Q(63) = 175.953, > 
.0001. The mean of the 18 correlation coefficients for the 
Torrance Tests of Creative Thinking (TTCT) was .218. 
The correlation coefficients were homogeneous, Q(17) = 
30.252, = .035. The mean of the 319 correlation coeffi- 

cients for the Wallach- Kogan divergent thinking measures 
was .116. The correlation coefficients were homogeneous, 
Q(318) = 402.833, />> .001. 

Post hoc contrasts revealed statistically significant dif- 
ferences {p < .0001) between Guilford Tests and Wallach- 
Kogan divergent thinking measures. On an a priori basis, 
Guilford Tests {r = .250) and Wallach-Kogan divergent 
thinking measures (r = .116) were also statistically differ- 
ent, = 145.689, < .0001. 


Creativity Test Types 

As Table 5 shows, the mean of the 357 correlation 
coefficients for verbal tests was .160. The correlation coef- 
ficients were heterogeneous, Q(356) = 738. 744, p> .0001. 
The mean of the 4 1 correlation coefficients for non verbal 
tests was .226. The correlation coefficients were heteroge- 
neous, Q(40) = 84.713, < .0001. The mean of the 46 
correlation coefficients for mixed tests was -.235. The cor- 
relation coefficients were homogeneous, Q(45) = 77. 314, p 
> .001. 

Post hoc contrasts revealed no statistically significant 
differences {p > .001) between verbal tests and nonverbal 
tests. On an a priori basis, however, verbal tests {r = .160) 
and nonverbal tests (r = .226) were statistically different, 

= 11.522,p< .001. 
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Table 5 

Types of Creativity Tests 
as a Moderator 


Creativity Test 
Type 

N 

(# of r) 

Mean r 

FTomogeneity 

Verbal 

357 

.160 

heterogeneous 

Nonverbal 

41 

.226 

heterogeneous 

Mixed 

46 

.235 

homogeneous 

Unreported 

3 

.068 

homogeneous 


Note. No statistically significant dififerences between difiFerent Creativity Test Types {j} > .001). 
Homogeneity: heterogeneous when^ < .001. 

Model for Creativity Test Types, QS(3) = 34.718 {p < .0001). 


Creativity Subscales 

As Table 6 shows, the mean of the 175 correlation 
coefficients for originality was .131. The correlation coef- 
ficients were heterogeneous, Q(174) = 267.721,y»> .0001. 
The mean of the 1 84 correlation coefficients for fluency 
was .170. The correlation coefficients were heterogeneous, 
Q(183) = 370.085,71 > .0001. The mean of the six corre- 
lation coefficients for figural redefinition was .362. The 
correlation coefficients were homogeneous, Q(5) = 2.531, 
p = .865. The mean of the 25 correlation coefficients for 
flexibility was .231. The correlation coefficients were het- 
erogeneous, Q(24) = 82.439,71 > .0001. 

Post hoc contrasts revealed statistically significant dif- 
ferences {p < .0001) between originality and figural redef- 
inition, between originality and flexibility, and between 
fluency and figural redefinition. On an a priori basis, orig- 
inality {r = .131) and fluency (r = .170) were statistically 
different, = 13. 241, 71 < .001. Originality (r = .131) and 
figural redefinition (r = .362) were statistically different, 
= A7.Vn,p < .0001. Originality (r = .131) and flexibil- 
ity {r = .231) were statistically different, = 38.477, 71 < 
.0001. Fluency {r = .170) and figural redefinition (r = 
.362) were statistically different, = ’55.766, p < .0001. 
Fluency (r = .170) and flexibility {r = .231) were statisti- 
cally different, = 15. 143, p < .0001. Figural redefini- 
tion (r = .362) and flexibility (r = .231) were statistically 
different, = 14.624, 7» < .001. 

Gender 

As Table 7 shows, the mean of the 186 correlation 
coefficients for males was .149. The correlation coefficients 
were heterogeneous, Q(185) = All.lll, p > .0001. The 
mean of the 180 correlation coefficients for females was 


.159. The correlation coefficients were homogeneous, 
Q(179) =213.439,71 = .087. 

Post hoc contrasts revealed no statistically significant 
differences {p = .736) between males and females. On an 
a priori basis, males (r = .149) and females (r = .159) were 
not statistically different either, = .614,71 = .433. 

Age 

As Table 8 shows, the mean of the 251 correlation 
coefficients for the elementary school group was .086. The 
correlation coefficients were homogeneous, Q(250) = 

260.451.71 = .660. The mean of the 27 correlation coeffi- 
cients for the middle school group was .210. The correla- 
tion coefficients were heterogeneous, Q(26) = 70.097, 71 > 
.0001. The mean of the 105 correlation coefficients for the 
high school group was .261. The correlation coefficients 
were heterogeneous, Q(104) = 163.761, 71 > .001. The 
mean of the 53 correlation coefficients for adults was .205. 
The correlation coefficients were heterogeneous, Q(52) = 

183.404.71 > .0001. 

Post hoc contrasts revealed statistically significant dif- 
ferences (p < .0001) between the elementary school group 
and the middle school group, between the elementary 
school group and the high school group, and between the 
elementary school group and adults. On an a priori basis, 
the elementary school group (r = .086) and the middle 
school group {r = .210) were statistically different, = 

59.051.71 < .0001. The elementary school group (r= .086) 
and the high school group (r = .261) were statistically dif- 
ferent, = 1 10.148,71 < .0001. The elementary school 
group (r = .086) and adults (r = .205) were statistically 
different, = \26.9A5,p < .0001. The high school group 
(r = .261) and adults (r = .205) were statistically different, 
X"= 12.036, 7i< .001. 

Multiple Regression Analysis 

Variables that were statistically significant moderators 
(based on the results of post hoc contrasts) were entered 
into a weighted linear multiple regression model to deter- 
mine their independent effects for explaining variation in 
the magnitude of correlation coefficients. The results of 
the direct entry of the four statistically significant moder- 
ating variables of IQ tests, creativity tests, creativity sub- 
scales, and age into a weighted multiple linear regression 
analysis indicated that age and creativity subscales inde- 
pendently accounted for variation in z,.. The regression 
model yielded R?. = .260, adjusted R?. = .235, F (221) = 
10.716, 7> < .0001. Next, the threshold variable was added 
to the previous weighted multiple linear regression analysis 
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Table 6 

Creativity Subscales as a Moderator 


Creativity Subscale 

N 

(# of r) 

Mean r 

Homogeneity 

Contrast 

/)-value 
for contrast 

Originality 

175 

.131 

heterogeneous 

ac, ad 

< .0001 

Fluency 

184 

.170 

heterogeneous 

be 

< .0001 

Figural Redefinition 

6 

.362 

homogeneous 

ac, be 

< .0001 

Flexibility 

25 

.231 

heterogeneous 

ad 

< .0001 

General Creativity 

57 

.206 

heterogeneous 




Note. Homogeneity: heterogeneous when p < .001. 

Mode! for Creativity Subscales, QB{A) = 88.380 {p < .0001). 


model. Adding the threshold variable did not change the 
significantly. The regression model yielded Ri Change = 
.004, AChange (1) = 1.269, A = -261. 

According to Johnson (1993), the tests of meta- ana- 
lytic predictor’s significance are inappropriate because the 
error degrees of freedom in the weighted regression proce- 
dure of SPSS are too large. Thus, the appropriate correc- 
tions in its regression model were made using DSTAT. The 
results of the weighted multiple linear regression of effect 
size on moderator variables are presented in Table 9. The 
various creativity tests ((3 = -.01179, z = -2.33020,/) < 
.001) and age ((3 = .03917, z = 6.19332,/) < .0001) had 
significant effects on the magnitude of the correlation coef- 
ficients between creativity test scores and IQ test scores. 
Various IQ tests ((3 = -.00170, 2 = -.53790,/) = .59.64) and 
creativity subscales ((3 = .01647, z = 3.72019,/) = .01980) 
were uncorrelated with the magnitude of the correlation 
coefficients. The results of the weighted multiple linear 
regression of effect size with the threshold variable are 
presented in Table 10. The threshold variable ((3 = - 
.00726, z = -.76628,/) = .44351) was not related to the 
magnitude of the correlation coefficients. 

Discussion 

The quantitative synthesis of the literature results indi- 
cate that the relationship between IQ scores and creativity 
scores is small and positive, in which the mean effect r 
was .174 (95% Cl, .165 - .183). The correlation coeffi- 
cients were heterogeneous, but the threshold of IQ 120, 
which was examined as one of the possible moderators, 
could not explain variance in the studies’ correlation coef- 
ficients. Further, when the IQ scores were divided into 
the four levels (1Q< 100 [r = .260]; 100 < 1Q> 120 [r = 
.140]; 120 < 1Q> 135 [r= .259]; 1Q> 135 [r= -.215]), 


Table 7 

Gender as a Moderator 


Gender Group 

N 

(# of r) 

Mean r 

Homogeneity 

Male 

186 

.149 

heterogeneous 

Female 

180 

.159 

homogeneous 

Combined 

81 

.193 

heterogeneous 


Note. No statistically significant differences between male and female groups ip > .001). 
Homogeneity: heterogeneous when p < .001. 

Model for Gender, QB{2) = 19.163 ip < .0001). 


there were no statistically significant differences among the 
levels according to the results of the post hoc contrasts, 
although IQ >135 had a negative mean correlation coef- 
ficient, r = -.215. Thus, the threshold theory was not sup- 
ported. 

The results of the post hoc contrasts revealed that IQ 
tests, creativity tests, creativity subscales, and age explained 
the differences in correlation coefficients between IQ and 
creativity tests scores. However, the variance in the mag- 
nitude of the correlation coefficients was not significantly 
explained by the various IQ tests, subscales of the creativ- 
ity tests, creativity test types, and gender, but it was sig- 
nificantly explained by different creativity tests {p < .001) 
and age {p < .0001) according to the results of the 
weighted multiple linear regression, which determines 
moderators’ independent effects for explaining variation. 

Post hoc contrasts revealed a statistically significant 
difference between Guilford Tests and Wallach-Kogan 
divergent thinking measures. According to the results of 
the weighted multiple linear regression, various creativity 
tests had significant independent effects on the magni- 
tude of the correlation coefficients between creativity test 


Winter/Spring 2005, Volume XVI, Number 2/3 63 


Kim 


Table 8 


Age as a Moderator 


Age Group 

N Mean r 

(# of r) 

Homogeneity 

Gontrast 

p-Yslnt 
for contrast 

Elementary 

251 .086 

homogeneous 

ab, ac, ad 

< .0001 

Middle 

27 .210 

heterogeneous 

ab 

< .0001 

High 

105 .261 

heterogeneous 

ac 

< .0001 

Adult 

53 .205 

heterogeneous 

ad 

< .0001 

Unreported 

1 1 .267 

heterogeneous 



Note. Homogeneity: heterogeneous when p < .001. 




Model for Age, QB{A) = 223.282 {p < .0001). 





Table 9 


Table 10 


Multiple Linear Regression 

Multiple Linear Regression 

of Effect 

of Effect Size With Threshold 

Size Zr on 

Moderator Variables 

(Weighted by Sample Size) 

(Weighted by Sample Size) 

Moderator 

Standardized 2 :-vaIue p-value 

Moderator Standardized z-vslnt 

p-value 

IQ Tests 

-.00170 -.53790 .59064 

IQ Tests 

.00085 -.26862 

.78822 

Greativity Tests 

-.01179 -2.33020 <.001 

Greativity Tests 

-.01143 -2.26266 

< .001 

Greativity Subscales .01647 3.72019 .01980 

Greativity Subscales 

.01638 3.70578 

.02366 

Age 

.03917 6.19332 <.0001 

Age 

.03976 6.29666 

< .0001 



Threshold 

-.00726 -.76628 

.44351 

Note. Overall regression effect = 

: 48.96, df= A,p < .0001 (two-tailed). 






Note. Overall regression effect = 49.565, df='^,p< .0001 (two-tailed). 



scores and IQ test scores. The mean weighted correlation 
coefficients between IQ test scores and Wallach-Kogan 
divergent thinking measure scores, r = .116, was much 
smaller than the mean weighted correlation coefficients 
between IQ tests scores and Guilford Test scores, r = .250. 
This might be because Wallach-Kogan divergent thinking 
measures were administered as a game-like activity, 
whereas the Guilford Tests were administered as tests. 

Wallach and Kogan (1965) concluded that traditional 
intelligence test scores are independent from creativity test 
scores when the creativity tests are assessed in a game-like, 
nonevaluative context. Some other studies also support 
this conclusion (e.g., Gropley & Maslany 1969; Iscoe & 
Pierce -Jones, 1964; Kogan & Pankove, 1972; Pankove & 
Kogan, 1968; Wallach & Wing, 1969; Ward, 1968). 
Torrance (1966) also recommended the creation of a 
game-like, thinking, or problem-solving atmosphere in 
order to avoid the threatening situation associated with 
testing. His intent was to set a tone so that the examinees 
would expect to enjoy the activities. 


Examinees should be encouraged to “have fun,” and 
they should experience a psychological climate that is as 
comfortable and stimulating as possible (see Kim, in 
press). Thus, according to the TTGT manual (Ball & 
Torrance, 1984), administrators of the tests should invite 
the examinees to enjoy the activities and view the tests as 
a series of fun activities, thereby reducing test anxiety. 
Torrance (1987) suggested a warm-up activity before 
administration of the TTGT for arousing the incubation 
processes and increasing motivation, thereby enabling cre- 
ative energy. Despite Torrance’s recommendation, how- 
ever, none of the studies using the TTGT included in the 
present study reported that the TTGT was administered 
in a game-like testing context. Thus, how many adminis- 
trators of the TTGT included in this study invited the 
examinees to enjoy the activities cannot be known. That 
might be why the mean correlation coefficient of the 
TTGT was .218, which was not significantly different 
from either test-like Guilford Tests (r = .250) or game-like 
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Wallach- Kogan divergent thinking measures (r = .116). 
Therefore, it can be concluded that, when creativity tests 
are administered in a game-like testing context, the cre- 
ativity test scores have smaller relationships with IQ test 
scores. 

The variance in the magnitude of the correlation coef- 
ficients was also significantly explained by the variance 
among age groups. For older groups (middle school, high 
school, and adult groups), IQ scores were more closely 
associated with creativity scores than within the younger 
group (kindergarten through fifth-grader group). Given 
that even the same tests may measure different processes 
when administered to subjects of differing ages (Ward, 
1968), creativity tests may also measure different con- 
structs among various ages. This is supported by previous 
studies (Kim, 2004; Kim, Cramond, & Bandalos, 2004, in 
press) that show that the latent structure of the TTCT is 
more invariant across gender than across grade groups. 
The relationship between creativity and IQ scores among 
younger children might be smaller because of less educa- 
tional influence over the use of their cognitive abilities, as 
compared with older people. 

In conclusion, the negligible relationship between cre- 
ativity and IQ scores indicates that even students with low 
IQ scores can be creative. Therefore, teachers should be 
aware of characteristics of creative students — this will 
enable teachers to see the potential of each child. In con- 
trast with the threshold theory, neither IQ 120 nor differ- 
ent levels of IQ scores explained variation in the 
correlation coefficients. The differences in the correlation 
coefficients between IQ scores and creativity test scores 
were not significantly explained by the various IQ tests, 
subscales of the creativity tests, or creativity test types, 
whereas different creativity tests and age were consistent 
moderator variables. The significantly independent mod- 
erator effect for creativity tests indicates that, when cre- 
ativity tests are administered in a game-like context, the 
creativity test scores have smaller relationships with IQ test 
scores than when creativity tests are administered in a test- 
like context. The significantly independent moderator 
effect for age, in which IQ scores were more closely asso- 
ciated with creativity scores for younger groups than for 
older groups, indicates less educational influence on stu- 
dents’ use of their cognitive abilities for younger groups 
compared with older groups. 

However, the findings of the present study in terms 
of the threshold theory are limited in generalizability 
because, for the 368 of the 447 correlation coefficients, the 
subjects’ IQ scores were unreported or unable to be used. 
For future studies, more studies that have reported IQ 
scores are needed. 
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